MaltParser and LIBLINEAR Transition-based dependency parsing with linear classification for feature model optimization
نویسنده
چکیده
In this thesis, MaltParser has been extended with an interface to a software for large-scale linear classification (LIBLINEAR). This combination was then used for learning and parsing with four different treebanks (Slovene, Danish, Arabic, and Turkish). The initial tests compared the accuracy of MaltParser using LIBLINEAR to that when using LibSVM (MaltParser’s default classifier) with its linear algorithm. The results are significantly different in favor of LibSVM for two of the four treebanks (Danish and Arabic). The LIBLINEAR classifier was then used for optimizing a feature model for each of the four treebanks. These results showed a significant improvement compared to the baseline feature model both when using only LIBLINEAR, and when using the optimized feature model with LibSVM. Learning and parsing times are much smaller when using LIBLINEAR, though accuracy is not quite as high as that of LibSVM’s polynomial kernel. In conclusion, using LIBLINEAR for feature model optimization is a good idea, since it takes advantage of the speed of LIBLINEAR while not sacrificing accuracy when using LibSVM’s polynomial kernel for the final feature model. This also indicates that the MaltParser/LIBLINEAR combination should be explored further, and preferably with larger treebanks.
منابع مشابه
MaltOptimizer: A System for MaltParser Optimization
Freely available statistical parsers often require careful optimization to produce state-of-the-art results, which can be a non-trivial task especially for application developers who are not interested in parsing research for its own sake. We present MaltOptimizer, a freely available tool developed to facilitate parser optimization using the open-source system MaltParser, a data-driven parser-g...
متن کاملAn improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملMaltParser: A Data-Driven Parser-Generator for Dependency Parsing
We introduce MaltParser, a data-driven parser generator for dependency parsing. Given a treebank in dependency format, MaltParser can be used to induce a parser for the language of the treebank. MaltParser supports several parsing algorithms and learning algorithms, and allows user-defined feature models, consisting of arbitrary combinations of lexical features, part-of-speech features and depe...
متن کاملMaltParser at the EVALITA 2009 Dependency Parsing Task
This paper describes our participation in the EVALITA 2009 Dependency Parsing Task with a version of MaltParser. Reusing feature models developed in the CoNLL shared task 2007, we evaluated four different parsing algorithms implemented in MaltParser and found that the best results were achieved with Covington’s non-projective parsing algorithm. In the final evaluation, our system finished third...
متن کاملAn Ensemble Model for the EVALITA 2011 Dependency Parsing Task
This paper describes our participation in the EVALITA 2011 Dependency Parsing Task with an ensemble model. In the 2009 edition we participated with MaltParser, reusing feature models developed in the CoNLL shared task 2007. In 2011, we compared the results obtained by different parsing algorithms implemented in MaltParser with an ensemble model made available by Mihai Surdeanu. The best results...
متن کامل